THE EFFECT OF NON-SMOOTH PAYOFFS ON THE PENALTY 
APPROXIMATION OF AMERICAN OPTIONS 



Abstract. This article combines various methods of analysis to draw a comprehensive 
picture of penalty approximations to the value, hedge ratio, and optimal exercise strategy 
of American options. While convergence of the penalised solution for sufficiently smooth 
obstacles is well established in the literature, see e.g. [H [34], sharp rates of convergence 
and particularly the effect of gradient discontinuities (i.e. the omni-present 'kinks' in option 
payoffs) on this rate have not been fully analysed so far. This effect becomes important 
not least when using penalisation as a numerical technique. We use matched asymptotic 
expansions to characterise the boundary layers between exercise and hold regions, and to 
compute first order corrections for representative payoffs on a single asset following a diffusion 
or jump-diffusion model. Furthermore, we demonstrate how the viscosity theory framework 
in |15| can be applied to this setting to derive upper and lower bounds on the value. In 
a small extension to [4], we derive weak convergence rates also for option sensitivities for 
convex payoffs under jump-diffusion models. Finally, we outline applications of the results, 
including accuracy improvements by extrapolation. 
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An American option is a financial instrument that gives its holder the right to claim a specified 
payoff on an asset at any time up to a certain date. Pricing an American option involves 
determining an optimal exercise strategy in addition to the price itself. 

There are two main equivalent approaches to formulating this problem: a probabilistic ap- 
proach based on stopping times, and a deterministic one based on a linear complementarity 
problem (free boundary problem) . The optimal stopping formulation in diffusion models was 
first introduced in [3] and |16j ; a concise outline can be found in |25j. For simplicity, we 
discuss first the Black-Scholes setting (cf. [5]), i.e., the stock price follows 



where a is the volatility, \x the drift rate, and W a standard Brownian motion. 

In jS], it is described how an American option in this setting can be priced using a linear 
complementarity problem (LCP) 
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1. Introduction 




dSt/S t = fidt + adW t 




min(-£ B sV, V- *) = 
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where \E f is the payoff and C is the Black-Scholes operator 

(1-3) £bs^ := ^ + b'S 2 ^ + (r - q)S^ - rV, 

where r is the risk-free interest rate and q a continuously paid dividend yield. The relation 
between stopping times and PDEs in this context is further analysed in |17j . 

In this paper, we are concerned in particular with the effects of the payoff function on a 
so-called penalty approximation to the value of such an option. Penalty approximations are 
useful both for the analysis [UGH] and numerical analysis (15] of the limiting problem, but also 
lend themselves to arguably the most efficient numerical approximation methods presently 
available for American option valuation jlO] . 

Penalisation of (parabolic) variational inequalities is classical (cf. j3j). The canonical penalty 



approximation of (1.2) is 



(1.4) -CbsV* = *max(^-V e ,0) 

for e > (cf. [3]). The penalty term on the right-hand side is only active when V € < and 
then it serves to push V upwards towards the payoff. 

In the context of American options, in chronological order, [321 EH EH] study the penalisation 
error for the Black-Scholes model and different penalty terms, and pQ uses penalisation im- 
plicitly to solve a calibration problem. [TTJ [TU1 [Ml O E7j introduce penalty approximations 
as a means of solving the discretised variational inequality. 



Effect of the Penalisation Error on Pricing and Hedging. This paper analyses the 
accuracy of a penalty approximation to the value function for relevant payoff functions. To 
do this, we first address the question of what a relevant measure of accuracy should be, which 
clearly depends on what the solution will normally be used for. 

Hedging American options requires knowledge of the hedge ratio, i.e. the amount of stocks 
held short in the hedging portfolio per unit long position in the option before it is exercised. 
In the complete market case of the Black-Scholes model, the hedge ratio is the so-called Delta, 

A t = (dv/dS)(s t ,t). 

Because we do not know the exact option value, but only its penalty approximation, we are 
exposed to three sources of error if we, say, buy and hedge an American option: 

a) We bid the - lower, as we shall see - price V e (So, 0) instead of V(So, 0) for the option 
at the outset. 

b) We hedge with the wrong hedge ratio 

dV e 
^ ~ ~dS~ 

instead of the exact Delta A^. 

c) We exercise at the wrong time 

r e :=mf{t:V £ (S t ,t)<y(S t )}, 
which is before the optimal exercise time r, since V e (St,t) < V(St,i). 
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The value of our hedge portfolio at t < t € is, by a classical replication argument (e.g. [25]), 



Jo 



where dW® = dWt + {p — r)/adt is the increment of a Brownian motion under the risk- neutral 
measure Q. 

Consider the stopping time r e A t. Then, by the Optional Stopping Theorem, 

¥P[X^ M — X T e M ] = Xq — Xq 

and, by Ito isometry, 

"T E At 



VQ[X^ M - X T c M ] = E / a 2 S 2 u ^- u \K - A u ) 2 du 

Jo 

fT 

< E / a 2 S 2 u e 2{t - u) {A e u - A u ) 2 du 
Jo 



T 







T roc / aire 31/ \ 2 



£ tV T / / l>(S {l A):S.u)S^ d ^(S,u)-~(S.u)) dSdu 



o Jo 

f T f°° I 'dV e dV \ 2 

where p(5o,0; S,u) is the transition density from So at time to S at time u, and the last 
inequality follows because p S 2 is bounded. The variance of the replication error at any time 
prior to exercise is therefore controlled by the squared H 1 distance between the true and 
penalty value function, 

pT poo / f)ye ay \ 2 

l|y£ " y|l? = i, Jo \~dS iS,U) ~dS {S,U) ) +( ve ^ u )- V ^ u )) 2dSdu ^ 

which is one of the error measures we will consider. Next, the loss incurred by exercising too 
early is 

(1.5) V(S T e,r £ ) - V'{S T ,^) = V(S T .,T*) - V(S T e), 

which is the (positive) difference between true and penalised solution at the (sub-optimal) 
penalty exercise boundary. Viewed differently, by hedging with A e we are replicating an 
option which is exercised not at the optimal exercise time, but at the crossing time of an 
approximate exercise boundary. An alternative measure of error is therefore the maximum 
distance 

||^ e -^||oo= sup \V(S,t)-V e (S,t)\, 

0<t<T,0<S 

which is also an upper bound for \Xo — Xq\ = V(Sq, 0) — V e (So,0). We will study the 
convergence in this norm also. 
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Extension to Jump Models. The suggestion to allow the asset to jump occasionally to 
reflect the possibility of sudden changes in the market was first made in [20J; an extensive 
overview and detailed discussion of jump models and their use in modern mathematical 
finance can be found in [6]. The pricing of American options in the presence of jumps has 
been developed and studied in |34l I22j. which remain the main references on the topic. 

We consider models where the underlying asset follows a jump- diffusion process, 

(1.6) dSt/St = Lidt + adW t + (J-l)dN t , 

where J is a random jump amplitude lying in [0, oo), and N a compound Poisson process 
with jump rate A > 0. The special case A = recovers the Black-Scholes model. 

Under the assumption that jump risk is unpriced, the value of an American option under 
jump diffusion can still be described by an equation of the type (1.2), but with 

BV B 2 V BV 

(1.7) C BSJ V := + W&Qp +(r~q- ^)S^ - rV + XE[V(JS, t) - V(S, t)}, 

where the expectation is taken with respect to the jump size J, for fixed S, and u = E[J — 
1]. This can be re- written as a partial integro-differential equation (PIDE) in terms of the 
probability density function g of J via 

roc 

E[V(JS, t) - V(S, t)\ = / V(SJ, t)g(J) dJ - V(S, t). 

Jo 

It will also be useful to consider the PIDE in log-coordinates, x = log (5/ So), u(x,t) = 
V(So exp(a;) , t) , where 

Bu B 2 u Bu roo 

(1.8) Cu = — + i<7 2 — -K + (r-q-uj\-a 2 /2)- ru+X / u(x + z, t)v(z) dz - u(x, t) 

Bt A Bx z Bx I j_ 00 

in (|1.2[) and (j>(z) = <&{Sq exp(z)) is the new payoff and v the density of Z = log (J). The 



pricing equation is then still fll.2b, the penalised equation (1.4), now with the operator rtl.Sh. 



We will see that the inclusion of finite activity jumps does not alter the properties of penalty 
approximations qualitatively, and all the general results later on in the paper are derived for 
this class of models. Some of the specific examples used for numerical examples and for which 
we derive asymptotic expansions use the Black-Scholes model for ease of exposition. It will 
be stated clearly at the start of all sections where this is the case. 

Structure of this Paper. In Section 2, we discuss a few representative examples of typical 
payoffs and present numerical results as motivation for the following analysis. In Section 3, 
we derive the leading order corrections to the penalty solution and the exercise boundary by 
matched asymptotic expansions, for the American put and butterfly. This analysis is useful 
because the localised structure of the error, especially for the derivative, is not visible from the 
more global functional analysis to follow and, to our knowledge, this is the first study of this 
behaviour. Section 4 generalises the convergence order of the penalisation error of the value 
to more general classes of convex (order e) and non-convex (order e 1 / 2 ) payoffs, and gives 
sharp upper and lower bounds on the solution, following the framework of [15] and extending 
it to jump processes. Section 5 derives H 1 errors, showing that the rate e 1//2 derived in [4| 
also holds under jump-diffusions and for non-smooth but convex obstacles. Finally, in Section 
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6, we discuss the results and their applications; in particular, we show how extrapolation can 
be used for accuracy improvement. 

2. Different Payoffs and Their Implications 

Part of the appeal of penalty methods as a computational tool is that the resulting algorithms 
do not depend on the shape of the payoff. In contrast to the formulation as free boundary 
problem (e.g. 'front-fixing' methods), the topology of exercise and continuation regions is 
irrelevant for the definition of the penalty approximation and (iterative) solution algorithms 
based on it. 

We will now illustrate how the shape and regularity of the payoff does, however, influence the 
approximation error. 



Example Payoffs and their Exercise Strategies. Two typical payoffs are the standard 
put payoff 

(2.1) *(£) = max(if - S, 0), 
with strike K > 0, and a butterfly spread 

(2.2) = max(V -a\S- K\,0), 

for some a,Vo > 0. We also consider an academic example of a 'modified' put, which is a 
sum of a put and a butterfly spread, and has a piecewise linear payoff. 

For American options, typically a exercise boundary determines the asset price(s) at which, 
for a given point in time, the optimal policy switches from holding the option to exercising 
it. 

Figures [TJ [2j and [3] show value functions with their free boundaries for different payoffs. 
For illustrative purposes, we use a Black-Scholes framework with no dividends, interest rate 
r = 0.05, volatility a = 0.4, and maturity T = 1. We will see later that jumps do not change 
the results qualitatively. 

We discuss the three examples in turn. 

(1) For the standard put, a smooth free boundary S*(t) separates an exercise region 
S < S*(t) from a hold region S > S*(t), and decreases strictly as we move away from 
expiry. Denote this vanilla American put value by P(S,t). Note that, at S = S*(t), 

BP B 2 P 2rK 

(2.3) P(S-( t) , t ) = K- S - (t) , _ = - 1, l;m^ = ^. 

see e.g. [8]. Before expiry, the solution is continuously differentiable with a jump in 
the second derivative (the Gamma) at the free boundary. 

(2) For the butterfly spread, short before expiry, the option value is greater than the 
payoff on the call-like side S < K, and is similar to before on the put-like side S > K 
with an exercise boundary between 100 and 150. The solution smooths the convex 
kinks which the payoff has at 50 and 100; however, at all times, the solution has a 
kink at 100. As we move away from expiry, the free boundary decreases strictly from 
150 at first and then, having reached S = 100, remains there. 
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American Put: Solution at different Times 





— At Expiry 

Time to Expiry = 0.4 

— Time to Expiry = 0.9 
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Figure 1. The value of an American put at different points in time (above) and the 
evolution of the corresponding free boundary (below). Before expiry, where it equals 
the payoff, the value function looks smooth. The boundary is strictly decreasing as we 
move away from expiry. 



(3) For the modified put, whose payoff is defined in the caption of Figure[3| there is again a 
single free boundary which separates an exercise region for small 5 from a hold region. 
Before expiry, the solution smooths the convex kink which the payoff has at about 137, 
but, near expiry, it still has a kink at 105; however, far away from expiry, the solution 
looks smooth. As we move away from expiry, the free boundary decreases strictly at 
first, stagnates (until smooth pasting is reached) and then decreases strictly again. A 
discussion on this 'waiting time' phenomenon in the context of diffusion problems can 
be found in [23], to which the present case adds a further example. 



Numerical Penalisation Error. We now analyse numerically the penalty convergence for 



these three examples. We use a Crank-Nicolson finite difference scheme to discretise (1.4) and 
solve the resulting non- linear discrete system by a Newton iteration (cf. [TO]). We compute 
a reference solution to the original LCP using PSOR (cf. [7]). We measure spatial errors 
pointwise in the maximum norm, and similarly for the derivative, which is approximated by 
finite differences of the numerical solution. We measure the convergence rate in the penalty 
parameter e from (1.4) as e — >■ by regression of the errors for small e. Generally, we use very 



fine time and space grids to suppress possible discretisation errors. 

Figure [4] shows the numerically computed penalisation error for the standard put as a function 
of S and t. It appears that the error is constant in the exercise region, jumps to about half 
this value at the exercise boundary, and decays for large S. The irregular behaviour of the 
error surface close to the exercise boundary is due to the movement of the exercise boundary 
between time steps relative to the grid points. Figure [5] is the corresponding picture for the 
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American Butterfly Spread: Solution at different Times 





— At Expiry 
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Figure 2. The value of an American butterfly spread at different points in time 
(above) and the evolution of the corresponding free boundary (below). (Possibly hard 
to see in the plot, there is only one free boundary which lies between 100 and 150.) At 
all times, the value function has a kink at S=100. The boundary decreases strictly at 
first and then stagnates. 

penalisation error in the first S'-derivative. The error appears localised in a very narrow region 
around the exercise boundary. The jagged shape of the surface results again from an interplay 
of the penalisation error and discretisation. For the chosen time step and mesh width, the 
width of the region of large error is small compared to the grid size, and, from one time step 
to the next, has a different location relative to its nearest grid points. For time steps, where 
the location of the maximum is close to a grid point, the plotted spike is large, whereas if the 
maximum lies between grid points, it is small. 

For the butterfly, as seen from Figure [6j there is an asymmetry in the penalisation error 
between the call-like side, where the error grows more steeply in time-to-expiry, and the put- 
like side, where the error is flat up to the the point in backward-time where the exercise 
boundary hits the top of the payoff, and from then on increases for larger time-to-maturity. 
The error is largest at the strike, constant in time, and decays rapidly on either side. 

The penalisation error for the modified put is shown in Figure [7} 

Table [T] shows the convergence rates in the penalty parameter e for the three payoffs. For 
the standard put, we find results for the spatial errors in V e and dV e /dS which appear con- 
sistent with O(e) and 0(e 1//2 ), respectively. For the butterfly spread, we find 0(e 1 ^ 2 ) for the 
spatial error in V e , but convergence in 8V e /dS seems to be very slow. Looking at the solu- 
tions (cf. Figures[TJ[2]), one readily suspects the concave kink of the butterfly spread, which is 
prevalent in the solution at all times, to be the reason for the slower convergence. This obser- 
vation is further supported by the fact that, for the modified put, we find convergence rates 
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American Put with additional Kink: Solution at different Times 
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American Put with additional Kink: Free Boundary 
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Figure 3. An American option on a sum of a put and a butterfly payoff, i.e., a put 
with an additional concave kink at S=105. We see the value at different points in time 
(above) and the evolution of the corresponding free boundary (below). Near expiry, the 
value function has a kink at S=105, but, further away from expiry, it looks smooth. 
The boundary decreases strictly at first, stagnates, and then decreases strictly again. 



Penalty Approximation Put Butterfly Spread Modified Put 
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Order in \ V — V e \ 


1.00 
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Order in | dV/dS - dV e /8S\ 


0.55 


0.57 


0.07 


0.08 


0.07 


0.06 


0.61 



Table 1. At different points t in time, we measure the convergence rates in \ V(-,t) — 
V e (-,t)\ and \dV/dS(-,t) — dV e /dS(-,t)\, where V denotes the true solution, as e — >• 0. 
For the put and the butterfly spread, the rates are the same at all times. For the 
modified put, the convergence rate in dV £ /dS improves hugely when expiry is far into 
the future (at which point the free boundary has overcome the concave kink). 



comparable to the rates of the butterfly spread near expiry, but, further away from expiry, 
where the concave kink has been smoothed out (cf. Figure[3]) , convergence rates improve to 
0(e 1//2 ) for the spatial errors in V € and dV e /dS. 

In the next section, we will use matched asymptotic expansions for small e to explain this 
behaviour and to derive the leading order corrections to the penalty solution. 



3. Approximation by Matched Asymptotic Expansions 
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Penalisation Error American Put 




Figure 4. Local structure of the penalisation error for the put. The error is largest, 
and roughly constant, in the exercise region, and decays rapidly over a small layer 
around the exercise boundary. 
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Figure 5. Local structure of the penalisation error for the put delta. The error is 
largest in a small layer around the exercise boundary. 
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Penalisation Error for Butterfly Spread 




Figure 6. Local structure of the penalisation error for the butterfly spread. The 
error is largest in a narrow region around the kink of the payoff. It is negligible on 
the put-like side up to the point where it is optimal not to exercise the option. 

3.1. Asymptotics for Put. We first consider an American put option on an asset in the 
Black-Scholes model with no dividends. The penalty equation is written 

(3.1) C B sV e = --max(K-5-y e ,0) = ~ max(K - S - V e , 0), 

e o z 

where we have introduced a dimensionless parameter 5 = oeh < 10 

There is a three-region structure, with 'outer' regions above and below an 'inner' region which 
spans the point at which the penalty solution crosses the payoff. We will see later that 5 is the 
characterisitic width of the inner region between the exercise and hold regions of the option. 
Initial (i.e. near expiry) transients are ignored (this means both the penalty term transient 
and the American-put transient). 

3.1.1. Outer region S > S*(t) ('hold'). First, write 

(3.2) V e (S,t) = K - S + W e (S,t) 
and expand 

W e {S, t) ~ W (S, t) + SWi(S, t) + 5 2 W 2 (S, t) + • • • . 

We expect Wo(S,t) = P(S,t) — (K — S) and Wi(S,t) = (because smooth pasting always 
leads to a smaller error than a barrier-type 'pinned' condition, cf. the asymptotic results on 
Bermudan options and discrete barrier options in [28l [13] ) . Wq satisfies CbsWq = —rK and 



We could have used r or T instead to scale 5, with the same eventuel answer. The choice we have made 
makes the intermediate calculations simpler. We also keep writing V e to keep the notation simple. 
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Penalisation Error American Put with additional Kink 
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Figure 7. Local structure of the penalisation error for the modified put. It shows a 
combination of features of the put and butterfly payoff, and a decay in time resulting 
from the waiting time phenomenon. 



True solution 
Penalty solution 




Figure 8. Schematic of a put option solution and blow-up of inner region. True 
solution is the solid curve, penalty solution is dashed. 



all Wi for i > satisfy the homogeneous Black-Scholes PDE, because the penalty is not 
active. 



Introduce the inner variable 

S = S*(t)(l + 5x). 
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Then Taylor-expanding W(S, t) about S = S*(t) and writing the result in terms of x gives 
the outer expansion expanded in inner variables as 

w~ ws(t) 

+ 5(xS*(t)W Q * s (t) + W?(t)) 
+ 5 2 (^x 2 S*(t) 2 W^ ss (t) + xS*(t)W? s (t) + W* 2 {t) 

(3.3) +-.., 
where 

dWn 

W *(t) = W (S*(t),t), WZs(t) = ^f(S*(t),t) etc. 

are functions of t alone and as-yet unknown. Hence, in the absence of spatial boundary 
conditions, we can do no more in this region for now. 

3.1.2. Outer region: S < S*(t) ('exercise'). This is the region below the exercise point. The 
penalty equation becomes 

2 

CbsW' = rK+^W £ . 

Simple iteration gives that 

(3.4) W e ~ -5 2 rK/a 2 + 0{5 i ). 
This dictates the scaling in the inner region. 

3.1.3. Inner region. Make the change of variables to x and t, and expand 

(3.5) W e (S,t) = w e (x,t) 

(3.6) ~ wo(x,t) + 5wi(x, t) + 5 2 W2(x,t) + • • • . 
The penalty equation becomes 

dw e S* ^ . ,dw e 1 2 (l + 5x) 2 d 2 w e l + 5xdw e 



dt 5S* v ' dx 2 5 2 dx 2 5 dx 

x > x*, 



(3.7) =rK + 



■2 



JlW X < x , 



where S* = d5*/dt, and with w e = and dw e /dx continuous at x = x* (i.e., x* is the crossing 
point of the penalty solution). 



3.1.4. Matching. As x — > — oo, we have (cf. (3.4)) 



wo — > 0, w\ — > 0, u>2 — > —rK/a 2 



and as x — > +oo we have (compare (3.3)) 

wo(x,t)~W3(t) + o(l), 

wi(x,t) ~ xS*(t)Wo* s (t) + W{{t) + o(l), 

w 2 (x, t) ~ ^ 2 5*(t) 2 ^o55W + ^)W?s(t) + W* 2 {t) + o(l) 
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The largest terms in (3.7) are 0(l/<5 2 ). When we substitute the expansion (3.6) in and collect 
terms, we get, at 0(1/5*), 

1 2 9 2 w _ J x > x*, 

2 dx 2 1 a 2 wo x < x*, 

and the only solution that vanishes at x = — oo, has continuous first derivative at x = x* , 
and tends to a constant at x = +oo, is wo(x,t) = 0. This tells us that 

W$(t) = 

as expected. Because Wq(3, t) is the difference between the vanilla value of the put and 
the payoff, its S-derivative vanishes at S = S*(t) — this is smooth pasting. (In more detail, 
because Wq(S, t) has the right value at S = S*(t) and the right payoff, uniqueness for solutions 
of the BSPDE in a parabolic domain tells us that it is the vanilla put value.) Hence, Wg S (i) = 
0. Now, at 0(1/5) in pTr] , we get 

1 2 d 2 w\ JO x > x*, 

2 dx 2 1 a 2 w\ x < x*. 

As WQ S (t) = 0, wi(x,t) has no linear term at x = +oo, and so, by the same argument 
as above, it vanishes too, confirming that the inner scaling for W is indeed 0(5 2 ). Hence, 
W*(t) = 0, and we can return to the outer region S > S*(t) to show that Wi(S, t) = (zero 
payoff, zero value on S = S*(t)). Now we come to the first non-trivial term. At O(l) in ( |3.7[ ), 
we have 

i od 2 W2 T ^ I X > X*, 

2 a ~~a~2~ =rK + \ 2 

z ox 1 \cr z Wo x < x*. 

For x < x*, the solution that tends to —rK/a 2 at — oo and vanishes at x = x* is 
(3.8) w^{x, t) = rK (V2(*-z*M) _ i) / a 2 . 



For x > x*, the solution that vanishes at x = x* and whose derivative matches (3.8) is 

rK 



(3.9) wj(x, t) = — (x - x\t)Y + rKV2/a 2 (x - x*(t)). 



Now comes the key point. From the matching, we now know that 

w+(x,t) ~ lx 2 S*(t) 2 WZ ss (t) + W 2 *(t), x -> oo. 



There is no linear term because Wi(S, t) = 0. Comparing with (3.9), we find that 

rK/a 2 = ^S*(t) 2 Wo SS (t) (coefficient of x 2 ), 

-2rKx*(t) /a 2 + rK^l/o 2 = (coefficient of x), 

rKx* (t) 2 /a 2 - V2rKx*{t) /a 2 = W 2 * (t) (constant coefficient). 

The first of these confirms the boundary Gamma of the vanilla put. The second gives 

x*(t) = 1/V2. 

The third gives 

W*(t) = -\rK/a 2 . 
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Penalisation Error 


Computed 


Predicted 


Relative Difference 


Value in Exercise Region 
Value in Hold Region 
Exercise Boundary 


4.9975e-04 
2.5070e-04 
0.0174 


5.0000e-04 
2.5000e-04 
0.0165 


-5.0075e-04 
0.0028 
0.0516 



Table 2. For the American put, the maximum penalisation error in exercise and hold 
regions separately, and the error in the exercise boundary, for = 100, a = 0.4, 
r = 0.05, K = 100, T = 1, hence 8 = 0.04. The numerical result is compared with the 
first order correction from the asymptotic analysis. 



In original variables, the crossing point is at 

S = 5*(t)(l + 8x\t)) 

= S*(t) (l + ^/V^j 

as 5 2 = <7 2 e, and the boundary value of the correction is 

5 2 W2(t) = -\rKe 

(independently of a). 

The correction is interpretable as the value of an option with zero payoff at maturity which 
pays a fixed amount when (if) the stock crosses S*(t) from above. Interestingly, the continuity 
correction for a Bermudan option has the same boundary value if we set 5 2 = a 2 AT/2, where 
AT is the interval between exercise dates |13] . 

3.1.5. Numerical verification. Table [2] compares the corrections based on W£ and x* against 
the numerically computed penalisation error and finds excellent agreement. 

3.2. Butterfly Spread. Still in the Black-Scholes framework without dividends, consider 
now a butterfly spread with payoff 

max(V + ai(S-K),0), S < K, 
max(Vb + a 2 {K- 5),0), S > K, 

where Vo, a, > 0. Denote again the penalty value by V e (S, t), the true value B(S, t). Consider 
the situation where B(S, t) has a free boundary on the put-like bit of the payoff (S > K) but 
on the call-like bit, the value of B{K,t) is anchored to Vq. This certainly happens for short 
times before expiry (as the Black-Scholes operator on the payoff is positive). The former is 
analysed as before so we focus on the region around the convex kink at S = K. 

The inner variable is now S = K{1 + 5x), the outer solution for S < K is of the form 

V e (S,t) = B{S,t) + 5V 1 (S,t) + ■ ■ ■ 
and its inner expansion near S = K is 

V + 5(xB&t) + V?(t)) + — , 
where B10) = dB/dS (K,t) and V*(t) = V\(K,t). The payoff in inner variables is 

Vo + 5a±Kx, x < 0, 
Vq — ba 2 Kx, x > 0. 
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0(SK) 




Figure 9. Schematic of a butterfly option solution and blow-up of the inner region 
near the peak. The true solution is the solid curve, the penalty solution is dashed. 



This suggests that the inner solution is of size 0(5), not 0(5 2 ), and this is consistent with 
the left-hand outer solution meeting the payoff at an angle (not smooth pasting). Write the 
inner expansion in the form 

V e (S,t)~V + Svi(x, «) + ••• • 

Also let x* (which is negative) be the point at which the penalty solution crosses the payoff. 
The leading order inner equation is 

i nd 2 vi o \vi — aiKx, x < 0, 
icr — — = a < 

z Ox 1 \vi + a 2 Kx, x > 0. 

The solution is C 1 at both x = and x = x*, where C 1 is the space of continuously differ- 
entiable functions. As x — > oo, v\ ~ — a 2 Kx because the solution is 0(5 2 ) in the 'exercise' 
region to the right of S = K. So, 



J a x Kx + a(t)K cosh(x v / 2) + b(t)K smh(x^/2), 
Vl ~ \ -a 2 Kx + a(t)Ke~ xV2 , 



x < 0, 
x > 0, 



for some a(t), b(t). This is continuous at x = 0, and continuity of dvi/dx at x = gives 

(3.10) ai + b(t)V2 = -a 2 - a(t)V2. 

Now, at x = x*, vi meets the payoff and joins onto the outer solution: 

v 1 (x\t),t) = a 1 Kx*(t), ^- = B : s (t), 

from which 

(3.11) a lX *{t) + a(i) cosh(x*(t) V2) + 6(t) sinh(x*(t) y/2) = a lX *(t), 

(3.12) ai + V2 (a(t) smh(x*(t)V2) + b(t) cosh(x*(t)v / 2) = B&t). 
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This, with (3.10), is three equations for a(t), b(t) and x*(t). From these, we readily find that 

♦AA 1 , "1 + «2 

X (t) = -= log ■ 



^2 b a x -B%ty 

This is clearly negative since we have < B* s < ol\. Also, it tends to — oo as B* s — > a± from 
below - that is, as (if) the free boundary moves away from S = K. 

We also derive the value V\(K,t) = a.\Kx*(t), which lets us compute the correction term, 
although not explicitly because of the complicated time dependence. 

3.3. American Put under Jump-Diffusion. We now extend the analysis under Black- 
Scholes to include jumps of relative size J, at the jump time of a compound Poisson process 
with rate A. We only do this for the standard put payoff to illustrate the extensions over 
Black-Scholes. The results are qualitatively very similar to the Black-Scholes case and the 
analysis suggest this will also be the case for other payoffs. In addition, we also account for 
continuously paid proportional dividends of rate q, where we assume q < r. (The solution for 
q > r is qualitatively different.) 



The penalised equation is defined as in (3.1), specifically, for the put under a jump model 

(3.13) C B sjV e = --max(K - S -V £ ,0) = ~ max(K - S - V e , 0), 

e o z 



where £bsj is defined in (1.7). Smooth pasting still holds for the vanilla American put value 
P(S,t), and applying these conditions just to the right of the exercise boundary now gives 
the boundary Gamma as 
(3-14) 

S| 5=5 , w = o^kfy 2 {rK ~ te + uX ) £r ® ~ mP(JS*(t),t)-p(s*(t),t)]) = : ^r*(t). 

Note that T*(t) depends on P(S,t) for all S > via the term E[P(JS*(t),t) - P(S*(t),t)]. 

3.3.1 . Outer region S > S*(t) ('hold'). It will be useful again to introduce W e {S,t) as in 
(3.2). As Se~ q ( T ~^ and Ke~ r ^ T ~ t ' > satisfy HbsjV = individually (in fact their difference is 
the value of a forward) , one gets 

£bsj^(S, t) = C BS jW e (S, t) - C BS j(S -K) = C B sjW*{S, t) - qS + rK. 

3.3.2. Outer region: S < S*(t) ('exercise'). With the above substitution, as W(S,t) < in 
this region by assumption, 

2 

£-BSjW e (S, t)=rK-qS+ ^W e (S, t). 

Inserting the expansion for W e (S, t), Wo(S, t) and Wi(S, t) vanish in the outer region S < S*(t) 
as before; however, when determining W2(S,t) in the 'exercise' region, we have to account 
for jumps into the other regions, particularly into the outer 'hold' region S > S*(t), where 
W e (S,t) will not be small. Thus, at 0(1), 

XE[W {JS,t) - W (S,t)] =rK-qS + a 2 W 2 {S,t), 

and, solving for W2(S,t), 

W 2 (S,t) = - ((rK - qS) - XE[P(SJ,t) - (K - JS)}) /a 2 

= - ((rK - qS) - XE[P(SJ, t) - P(S, t) + (J - l)S}) /a 2 . 
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It is seen from the above equations that W 2 (S,t) is negative and inreases with increasing S, 
from W 2 (0,t) = -rK/a 2 at the origin, i.e. the penalisation error is largest for 5 = 0, where 
it has a simple explicit value. This also follows from a maximum principle argument, as we 
will see in Section HI 




3.3.3. Inner region. Similar to (3.7), we now have 
. dw e S* r ,dw e 1 2 (1 + 5x) 2 d 2 w e 1 + Sxdw € 

+ E[W e (JS*(l + 5x),t) - W e (S*(l + 5x),t)} 

= rK -qS*(l + 6x) + 

The non-local term is written in terms of the outer solutions because it acts on the scale of 
the outer variables. Writing the expectation term as integral and expanding in 5 gives, at 
leading order, 

poo 

5S*x / JW 6 s (JS*,t)g(J)dJ. 
Jo 

The simple expansion has a natural interpretation: given a jump size, all jumps starting from 
the inner region and ending in the outer region end up close to each other. If we were going 
to a higher order of accuracy (which we are not), we would have to treat the small jumps 
- those which both start and end in the inner region - separately. So the integral for the 
expectation would have its range split into inner and outer parts, and so on. 

Comparing terms 0(l/5 2 ) and 0(1/5) gives again that wo(x, t) and w\(x, t) vanish, and now, 
at 0(1), 

l 2 (fiw2 j m /O x>x*, 
ox 2 \a 2 W2 x < x*. 

3.3.4. Matching. First, we match the inner solution with the outer solution in the exercise 
region. The matching of for x — > — oo is now to a non- const ant value, but it is clear that 
W 2 (S, t) from (3.15) approaches T*(t) for S — > S* in the 'outer' variables, and matching in an 
overlap region demands that, as x — > — oo, w 2 (x,t) — > —T*(t). Then calculations identical to 
before give, for x < x*, 

w^(x,t) = T*(t) ^ e V2(*-^*)) _ l) , 

and, for x > x*, 

(3.16) w£(x, t) = T*(t) ((x - x*(t)) 2 + V2(x - »*(*))) • 

Matching with the outer region S > S* as before gives 

r%t) = ls*(t) 2 wz ss (t), 

T*(t)(-2x*(t) + V2) = 0, 

F*(t)(x*(t) 2 - V2x*(t)) = W^(t). 

The first equation recovers the jump diffusion gamma from earlier. Interestingly, the relative 
position x* of the penalty crossing point in relation to the exercise boundary, which is given 
by the second equation, is unaffected by the jumps. The last equation, upon inserting x* , 
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shows again that the penalisation error at the free boundary is half the value one would get 
by extrapolation from the outer exercise region. 



4. General Upper and Lower Value Bounds 

In the previous section, we computed the penalisation error to leading order in the penalty 
parameter, and noted a distinct difference in the convergence order for the put, 0(e), and 
butterfly payoffs, 0(e 1/2 ). We now show that a distinction into categories of piecewise smooth 
payoffs with convex and non-convex kinks allows us to derive general upper and lower bounds 
on the value function. We work with jump-diffusion models. 

4.1. A Maximum Principle Argument. Considering the penalised equation 

(4.1) -£ BS jV e = ^max(*-V e ,0), 

it is automatically true that — J~>bsjV € > 0, and if (where) V e > ^, then £bsj^ £ = 0, such 
that a complementarity condition is satisfied and min(— jCbsj^i V € — < 0. Hence, V e only 
fails to be an exact solution to 

min(-£ BS jn^ e -*) <0 

where V e > ^ is violated. 



We begin with an elementary analysis of this inequality constraint. To this end, consider 
W e = V e — ^. The biggest violation of V e > is given at a global negative minimum of 



W e (if one is attained). Note that, for t < T, the solution V t to (4.1) is twice continuously 
differentiable everywhere in S, by standard regularity arguments. We first consider points 
at which \& is also smooth, i.e. excluding the kinks. Then, at any such negative minimum 
S = S* of W e , by inspection of the individual terms, 

dW e 1 o ^d 2 W e dW e 
(4.2) CwW = C ^ f + ^ 2 S 2 ^- + (r-q- w\)S^ ~ rW^ 



+ XE[W 6 {JS,t) - W e (S,t)] > 0. 



From 



€ 



it follows that 

(4.3) F e >* + e£ B SJ^- 

For piecewise linear payoffs it is straighforward to show that, again excluding kinks, £bsj x I / 
is bounded from below, uniformly for all S. Also, W e = V € — does not have any negative 
minima at convex kinks of i.e. points with fy'(S—) < fy'(S+). 



Summarising, the biggest violation of the inequality V e 



> is either bounded by (4.3), taken 

or 



over the smooth intervals of ^, or at concave kinks, or at one of the boundary S 
S — )■ oo. We will come back to this observation later to obtain easily computable bounds on 
the solution. 
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4.2. Constructing Bounds on the Value Function. To treat the solution uniformly in 
the hold and exercise regions, inclusive of kinks, it is convenient to work in the framework of 
viscosity solutions. We use the equation 

(4.4) min(— Cu, u — tp) = 



in log coordinates on R, with C as in (1.8), ip(x) = ^(S) = ^(Sq exp(x)), and its penalised 
version 

(4.5) -Cu € = -max(ift-u £ ,0). 

This avoids technicalities of boundary conditions and discontinuous viscosity solutions, and 
we can use the definition from [22], which we tailor to our setting for convenience: 

Definition 4.1 (Viscosity Solution). w£C([0,T]xl) is a viscosity supersolution (subsolu- 



tion) of ( g-J ), if 

min(-(£0)(x, t),4>(x, t) - tp(x)) > (< 0) 
whenever eft G C 2 ([0,T] xR)n C2([0, T] x M.) and u — (ft has a global minimum (maximum) 
at (x,t) G [0, T) x E toii/i v(x,t) = cft(x,t). u is a viscosity solution iff it is a super- and 
subsolution. 

Here, C 2 is the space of twice continuously differentiable functions, and C2 the space of 
continuous functions with at most quadratic growth at 00. This includes the put and butterfly 
payoffs, but not the call payoff in log-coordinates. It is clear though that the results can be 
extended (e.g. by a coordinate transformation identical to the logarithm for small values, the 
identity for large values, and a smoothly increasing transition in between). We further assume 
that the density v gives bounded third moments. 



Pham |22j shows that under these conditions (1.2) satisfies a comparison principle 



Theorem 4.2 (Theorem 4.1 in [22J). If u and v are uniformly continuous sub- and superso- 



lutions of (1.2) respectively, and u(x,T) < v(x,T) for all x, then u < v everywhere. 
It is clear that u e is a classical subsolution of ( |4.4| ), 

min(— Cu e , u e — ift) = min(e max(i/> — u e , 0), u e — ift) < 0, 
and therefore also a viscosity subsolution, thus u e < u is a lower bound for the true solution. 

We now seek to construct an upper bound by setting 

(4.6) u € := u e + A e , 

(4.7) X e := min{A G R, A > : u e + A > ift} = max{(^ - u 6 ) + }. 



Indeed, u e is a (classical and viscosity) supersolution of (4.4), 

min(— Cu e , u e — ift) = min(rA e + emax(ip — u e , 0),u e — tft) > 0, 
and therefore u e > u. 

This is closely related to the regularised Lagrange multiplier approximation in p3], who 
propose to solve 

— Cu = max ((tp — u)/e + A, 0) 

for some fixed function A > large enough to make the solution feasible. This is possible 
for sufficiently regular obstacles, i.e. for convex kinks in our framework, as it essentially 
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Upper and Lower Bounds for Put Upper and Lower Bounds for Butterfly Spread 




Asset Price Asset Price 



Figure 10. Illustration of the lower bound V e , upper bound V — V 11 + vasx{()$! — 
V e ) + }, payoff ^ , and true value function V for the American put (left) and the 
American butterfly spread (right), for e = 100 (put) and e = 0.00005 (butterfly). We 
are thinking of e as a small number, however for very small values, the bounds for the 
put become optically indistinguishable from the solution. 



corresponds to A e = eA = O(e). In fact, in the following section, we show that the order of 
A e is either 0(e) (no 'active' concave kinks) or 0(e 1 ^ 2 ) ('active' concave kinks), as expected 
from the asymptotic expansions. 



From (4.3) and the discussion thereafter, we know that X e can be estimated from (4.7) by 



using the right-hand side from (4.3) and values at concave kinks and boundaries. We can 
use this fact to compute simple lower and upper bounds, which converge to the true solution. 
Figure [lO] illustrates this for the put and butterfly. Note the different magnitude of the penalty 
parameter required for the put and butterfly to achieve similar accuracy. 

4.3. Convergence Rates and Further Properties. The following results follow directly 
from the comparison principle. 



Lemma 4.3. Let tp and tjj be as before. Denote by u and u the solutions to Problem 4-4 that 
correspo nd t o yj and ip, and, similarly, denote by u e and u e the corresponding solutions to 
Problem 4-5 If 'ip <ip everywhere, then we have 

u e < u e and u <u. 

Trivially, u e and u are nonnegative if ip > 0. Moreover, denote by u €l and u t2 the solutions 



to (4-5) corresponding to penalty parameters e\ > ei > 0, respectively. Then u ei > u t2 . 
Proof. For the first part, consider 

min(— Cu, (j> — u) < min(— Cu, <fi — u) 

and 

—Cu e = - max(-0 — u e , 0) < - max(^ — u e , 0), 
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such that u and u e are subsolutions to their governing equations with ifi replaced by if). 
Similarly, for the second part, apply the same argument to 

—Cu €1 = — max(i/> — u ei , 0) < — max(^/j — u ei , 0) . 
ei e 2 

□ 

We can apply the framework of [15], pp. 4-8, to estimate \ £ . 

Theorem 4.4. If ip is Lipschitz continuous and piecewise C 1 with linear growth and 

(1) convex kinks, then 

< u - u e < Ce; 

(2) concave kinks, then 

< u-u e < Ce l/2 . 

Proof. This follows precisely the steps in the proof of Theorem 2.1 in |15j . Although the 
context there is that of non-linear PDEs, the results are sufficiently abstract to accommodate 
PIDEs given a comparison principle. The main steps are based on smoothing the payoff with 
mollifiers, and bounding the approximation error in the two cases. □ 

5. Solution of a Variational Formulation 

From the previous section, we know max x \u(x,t) — u e (x,t)\ = 0(e) for payoffs with convex 
kinks. Combined with the differentiability of u e w.r.t. x, where the size of the derivative is 
independent of e , 

du _ ujx + e 1 / 2 ) -u(x,t) | ^ 1/2 ^ _ u e {x + e l l 2 ) -u € (x,t) | 1/2 ^ 
dx { ' ~ e 1 / 2 -rv^e )- ^ + u l e I 

allows us to estimate the derivative up to e 1 / 2 by a finite difference, which naturally 'reg- 
ularises' the differentiation. We know e.g. for the put from the asymptotic expansion that 
convergence will be better behaved everywhere except in a small neighbourhood (of width 
e 1 / 2 ) of the exercise boundary. For non-convex kinks, convergence will be slower. 

This section develops estimates of the penalisation error for the derivative directly, via analysis 
in the H 1 norm. We follow here the set-up of [34| . who show convergence of penalisation in 
jump-diffusion models, but do not derive convergence orders. 



5.1. Set-up. We study problems (4.4) and ( |4.5| ), but on a localised domain := {x G R : 
\x\ < 1} with boundary d£l := {i £ R : \x\ = I}. It would be possible to work on M, but this 
would require us to introduces weighted norms to be able to deal with functions that do not 
decay (sufficiently fast) for large x (such that their Sobolev norms are well defined), making the 
variational formulation more cumbersome to write out. Instead, on the finite domain, we can 
use the standard (separable Hilbert) spaces H := L 2 (0) and V := {u G H : du/dx G H} = 
H 1 ^) [2]. For S G {H , V}, define L 2 (0, T; S) as the (separable Hilbert) spaces of measurable 
functions u : [0,T] — > E satisfying u(-,t) G E for almost every t G [0,T] and for which 
\u(-, t)||; dt < co, equipped with their canonical inner products (cf. [19J). The (Banach) 
space L°°(0,T;I7) is defined to contain all measurable functions u : [0,T] — >• E satisfying 
u(-,t) G E for almost every t G [0, T] and for which |u| ioO ( t-e) := esssu Pte[0T] \ u ('^)\s < 00 
(cf-PJ). 
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Now, for u G H and if!!, define the jump operator 

{Bu){x) := A / u(x + z) v{z) dz — u(x) 

J z,z+x££l 

For u, v G V , define the bilinear forms 

. a 2 f dudv f f ( a 2 \du 1 

a(u ' v) = y Ldx- dx dx + L ruv dx -J Q ^- y) d~ x v dx > 

where fi = r — q — coX and 

b(u,v) := — (Bu)v dx. 



We assume in the following that ip : M — > M is continuous and there exists a constant M > 
such that 

(5.1) |V>(x)| < Me M|x| , xei, 

then set 



/(x) := A / ?/>(a; + z) v(z) dz, x G 

and assume that f € H. 

The following two lemmas are taken from |33| . to where we also refer for the proof of the 
subsequent theorem. 

Lemma 5.1. We have B G C(H ,H), i.e. B : H ^ H is a bounded linear operator. 

Proof. See [33]. □ 

Lemma 5.2. There exist constants £ > such that 

a(u, u) + b(u, u) > i?Hy — CMi? > w € V . 
Proof. See [33]. □ 



We are now in a position to formulate the variational inequality the solution of which is the 
value function of the American option. 

Problem 5.3. Find a function u G L 2 (0,T; V), du/dt G L 2 (0,T;H), such that 

u(-,T) = ip, u(x, •) = ip(x) for x G dCl , u > Tp a.e. on Q, x [0, T] 
and, a.e. on [0,T], it is 

(du \ 
— ,v — uj + a(u, v — u) + £>(m, w — u) — (/, v — u) > 

for all v £ V with v > if). 

We emphasise that the payoff function ip is not confined to the interval [—1,1] and that the 
function / is given by a nonlocal integral. 



Theorem 5.4. There exists a unique solution u to Problem 5.3 with u G L°°(0,T; V). 
Proof. See [33] or [31]. □ 



PENALTY APPROXIMATION: NON-SMOOTH PAYOFFS 23 

5.2. Penalisation and Basic Properties. 

Problem 5.5. Let e > and define /3(-) := —(tp — -) + . Find a function u e G L 2 (0,T;V) , 
du e /dt G L 2 (0,T;H), such that 

u e (.,T) = V, u e (x,-) = ^(x) for x £ dn 

and, a.e. on [0, T], we have 

(5.3) a(u e , u) + 6(n e , v) - (/, «) + 1 (/3(u e ), w) = 

/or all v £ V . 



Theorem 5.6. There exists a unique solution u £ to Problem 5.5. Furthermore, there exists 
a constant C > 0, C independent of e, such that 



1 



(5-4) K|i,°°(0,T;V) + -Jfi\P(u e )\ L 2(0,T;H) + 



Ot 



< C. 

L 2 (0,T;H) 



As e — > 0, we have that u e — > u strongly and du e /dt — > du/dt weakly in L 2 (0, T; H), where u 
is the solution to Problem\5A 



Proof. A result of this form comes up naturally when using penalisation to prove the existence 
of a solution to a variational inequality. In this particular case, it can be directly obtained by 
adapting the proof given in [33J for the non-localised problem. Alternatively, one can slightly 
extend a similar result given in [3]. □ 



Remark 5.7. The results of Lemma 4-3 still hold for the localised variational problem, but 
we omit the proof of this. 

5.3. The American Put and Other Payoffs with Convex Kinks. The following result 
is an extension of the one in [3] to jump diffusion, and to accommodate kinks. In the proof, 
we work with weak coercivity instead of coercivity, and account for the loss of regularity at 
kinks explicitly. 

Theorem 5.8. Consider an American put option, i.e. let if} be given by 

yj{x) = {K-e x ) + , xGR, 

and suppose f G H . There exists a constant C > 0, C independent of e, such that 

\u - u e \ L 2 {0tT . v) + \u- u e | L oo (0jT;H) < e 1/2 C. 

Proof. Again, we extend a proof that was given, for standard parabolic variational inequalities 
in Hq , in All constants Ci , with i an integer, are taken to be independent of e and t. 
Plugging —f3(u e ) G V into (|5.3[) gives 



( — P(u e )) -a(u\ /3(u e )) - b(u* , f3(u^)) + (/, f3(u*)) - - (/3(u e ), /3«)) = 0, 



v dt 

which is equivalent to 



(<|(u £ ), f)(u<j) - a{fi{u% fi(u*)) - I (/3(u% /3(u e )) 

= a(V,/3(u e )) +6(n%/3(n e )) - (/,/3(u e )) - Q,^) 
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Note that ip is independent of t. Integrating from t to T, t G [0,T], we obtain 

l-T 



1 



1 



P{u e (t))\ Z H + / a{p(v, e ),0(u e )) ds + -\^)\ 2 L2{t 



T;H) 



(5.5) 



T 



a(^,/3(0) + b{u*,f3(u*)) - (/,/3(n e )) da. 



Recall Lemma 15.11 and note that 



i(ip, f3(u e )) ds 



T ■ a 2 r dip d 



2 J n dx dx 



(tp — u e ) + dx 



(5.6) 
in which 



+ r I ip (tp — u e ) + dx — ((jl— — ) / ^ (tp — u e ) + dx\ds, 

In 2 In OX ) 



dip d 



{ip — u e ) + dx^jds 



dx dx 

As ip is known explicitly, we can write 

dip d 



d 2 ip 
dx 2 



(ip — u e ) + dx^jds. 



(5.7) 
(5.8) 
(5.9) 



r 



dx dx 



(ip — u € ) dx ds 



(ip — u e ) + dxjds 



< 



f (- e x (ip-u e Y 
f (J g e x (ip-u e ) + dx^jds < K J ( J (tp - u e ) + dx^jds. 



result in Remark 



XlogK))' 



5.7 



To get from (5.8) to ( 5.9), we used the following fact: since u e G L (0,T; V), a monotonicity 
gives that u e (s) > for almost every s G [0,T]; hence, (ip(logK) — 

! (log -ftT)) + = almost everywhere on [0, T]. Having observed this, 



applying (5.9) to ( |5.6[ ), we then obtain 

rT 



(5.10) 



( (V,/3(n e )) ds<C \iP\ H i in) \(3(u")\ L 2 {t 



T;H)i 



which, applied to (5.5), gives 

rT i 



a 



ds + -\f3(u<)\ 2 L2 ^ T . H) < Ci^l^^ |/3(n e )| 

L 2 {t,T;H) 

+ \u e \ L \t,T;H) \P{u e )\h\t,T;H) + I/|h \P(u e )\ L \t,T;H) ) • 

The splitting of the integral was necessary because of the kink of ip, whereas for ip\^ G H 2 (Vi) 
the last inequality follows directly by integration by parts. Applying Lemma 5.2 and (5.4) to 
the last expression, we then get 

1, 



(5.11) 



Mu e )\mt,T- H ) <c 2 



for < e < 1. Next, applying Lemma [572] , (5.4) and (5.11) to equation (5.5) yields 

(5-12) \f3(u e )\ L l(t,T;V) + \P(u e )\ L ~{t,T;H) < . 
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2r> 



We define r € := ip — u + (tp — u e )~ , where (tp — u e )~ := — mm{ip — u e , 0}; in particular, this 
means u e — u = r e + f3(u e ). Owing to (5.12), to prove the theorem, it is now sufficient to show 
that 

\ re \L 2 (t,T;V) + \ re \L°°(t,T;H) < ^ C\. 

We set v = r e + u = ^ + (ifj — u e )~ > t/j in (5.2) and v = —r e 6 V in (5.3) and sum the two 
expressions to obtain 

- (— (u - r e ) + a(« - u e , r e ) + &(u -u e ,r e ) + - (/3(u e ), u - V) > 0. 

As — ^(/3(u e ),it — if) > 0, we further get 

-^(u e - u),r e ) + a(u e - u,r e ) + 6(n e - u,r e ) < 0, 



which in return gives 
/dr e 



V dt 



,r*)+a(r*y) + b(r e ,r*) < (^(O, r e ) - a(0(u e ), r s ) - &(/3(u e ), r e ) 



r . 



We define r e := e^r € , where we use £ from Lemma 5.2 Multiplying both sides of the last 
inequality by e 2 ^*, we get 



dr € 
~dt 



r e I + £(r e , r e ) + a{r e , r e ) + b(r e , r € ) 



< (^[^(n £ )],P) - ?(e^(. £ ),P) - a( e % £ ),?) ^(e^^j.P). 

Noting that r e (T) = e^ T [^(T) - u(T) + (ip(T) - u e (T))~] = 0, integrating from t to T, 
[0,T], gives 



(5.13) -1^(01^+/ £(F e ,F e ) +a(r e ,r e ) +b(f e ,f e ) ds 



T 



£(e ?t /3(u £ ),P) +a(e^f3(u e ),?) +b(e & P(u e ),?)ds. 

We now make three observations, which, taken together, will yield the desired result. First, 
according to Lemma 5.2, we have 



ds 



r 



T 



£(r e , r e ) + a(r e , r e ) + b(r e ,r e ) ds > $ / \r e \ v ds. 



Second, according to (5.4), 

(■T , gfe 



< ie^ T \P(u-)\ L ^ T . H) V\ L 2 (t ^. H) +e^ T \^)\ L ^ T . H) 

< C 5 \/3{u e )\ L 2 (t:T . H) \r e \ L 2 {tjT . H) + eC 6 . 



du 



dt 



L 2 (t,T;H) 
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Third, we have 

-(e^« e )(i),f*(i)) 



£(e^(u e ),P) + a(e ?i /3(u e ),P) + b(e^f3(u £ ), ?) ds 

< C r \P(u e )(t)\ H \r e (t)\ H + C 8 |/3(n e )| L2(tjT . y) \r £ \ L ^ T . v) . 
Applying the last three statements as well as (5.12) to (5.13) completes the proof. 



□ 



Finally, we formulate a corollary which states that the result just given for the American put 
also holds for a wider class of functions including a number of traditional option payoffs. 

Corollary 5.9. If there is a finite number of disjoint open intervals Ii : = (xj , ccj+i), < % < 
N, such that [J i=0 [xi , a^i+i] = [—1,1], £ H 2 {Ii) for < i < N, and, additionally, 

dip dip 
lim -7r-(xi) < lim -^-(xi) 

xfxi OX xixi OX 



for 1 < i < N, then the result of Theorem 5.8 also holds. In particular, this includes piecewise 
smooth functions that are convex, a straddle and an American call. 



Proof. Integrating by parts, we can write (5.7) as 

N 



u 



- %i 



N 

E 

i=0 



d 2 ip 
T . dx 2 



(ip — u e ) + dx, 



in which the first term on the right hand side equals 

N 



i=l 



u 



(xi))' 



< 0, 



and we can replace (|5.10) by 

ds < C 7 ^2\ip\ H 2 (Ii) \P(u e )\ L 2 itjT . H) . 



N 



i=0 



Having done this, we then proceed as in the proof of Theorem 5.8 



□ 



At this point, essentially, what we have found is that convex kinks in otherwise smooth 
payoff functions will not affect the known results on the convergence rate of the penalty 
approximation in the H 1 norm, and that these results carry over to jump-diffusion models as 
expected. 



6. Discussion and Applications 

6.1. Summary of Results. In this section, we summarise the estimates on the penalisation 
error obtained in this paper. We focus on two settings: that of convex kinks between other- 
wise smooth (usually linear) payoffs, and that of concave kinks. As concave kinks result in 
lower convergence order, it is clear that in situations with mixed convexity this behaviour is 
dominant. 

Table [3] confirms that the convergence order predicted by asymptotic analysis is in line with 
the numerically estimated one in all situations. 



PENALTY APPROXIMATION: NON-SMOOTH PAYOFFS 27 



Convex kinks Concave kinks 







W l 

' ' oo 


L 2 




Loo 


W v 

" oo 


L 2 




Numerical estimate 


1 


0.55 


1 


0.76 


0.5 


0.07 


0.61 


0.29 


Asymptotic expansions 


1 


0.5 


1 


0.75 


0.5 




0.5 


0.25 


Functional analysis 


1 




0.5 


0.5 


0.5 











Table 3. Order of convergence in the penalty parameter, for different measures and 
payoff types, and as predicted by different methods of analysis. indicates convergence, 
but of no positive order; ' — ' indicates no known result; '* ' indicates no convergence. 
The numerical estimate was obtained by regression of the errors for different numeri- 
cally computed penalised solutions. These and the results under asymptotic expansions 
were computed for representative payoffs (put and butterfly). 



The lack of uniform convergence of the penalty butterfly Delta, denoted by the results 
from the jump of the exact Delta at the strike, which cannot be matched simultaneously on 
both sides by the continuous penalty Delta. However, the asymptotic analysis also reveals 
that the error in the Delta is 0(e 1//2 ) except in a region which is also of width 0(e 1//2 ). 

The rates in the H 1 norm can be explained by the asymptotic analysis as follows: for the 
put, e.g., it gives an error in the derivative, of 0(e 1 ^ 2 ) in the inner region of width 0(e 1 / 2 ), 
resulting in an L 2 error of the derivative of 

^0((e 1 /2)2 £ l/2) = ( e 3/4)_ 

The error in the outer region is integrable and 0{e) (we can just differentiate the outer 
expansion) and therefore negligible. The contribution of the solution to the zero order term 
in the H 1 norm is also of order 1. A similar argument explains the order 1/4 for the butterfly. 

The higher level functional analytic estimates are not sharp in these cases. 



6.2. Interplay Between Penalisation and Discretisation. A comment is due on the 
effect of discretisation on the penalty convergence order, and, conversely, of penalisation on 
the grid convergence order. 

Penalisation of discrete systems. Consider a discretisation with mesh width h and time step 



k, say a central difference fully implicit scheme, then (1.2) becomes 

(6.1) min(Ac — b, x — c) = 0, 

where, for Black-Scholes on equally spaced grid points Si, 1 < i < N, 

1 o ~o , o 3^4-1 2x^ -\- Xi — \ 



(6.2) 



(Ax) 



k-a 2 Sfh 2X ^- 
2 



krbi h krxi, 

2h 



such that A S 



>NxN 



is an M-matrix, b,c,x S 



[we assume xq and xat+i are fixed by 



boundary conditions). The solution of (6.1) for fixed iV is invariant to scaling of the two 



arguments of the min by a positive constant. However, for asymptotic considerations for 



large N, the scaling is crucial for the direct iterative solution of (1.2), as discussed in [12]. 
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Scaling is also relevant for the size of the penalty term. In particular, [TO] consider a penalised 
equation 

Ax e — b = Large max(c — x 6 , 0), 
for a large positive parameter Large, and show that 
(6.3) \\x — x e \\ < C / Large. 

We now elucidate the relation between Large and e. A key estimate on p. 2117 in [TO] is 

\\Ac\\ < const, 

for some positive constant and where c is the payoff, and || • || the maximum norm. For a 
Lipschitz payoff, finite differences for the first derivative, say central differences as per ( |6.2[ ), 
applied to the payoff, are bounded as h — > 0, and second order differences are 0(l/h). So 
as A contains the spatial finite differences multiplied by k, \\Ac\Ioq = m&Xi(A c )i ~ k/h for a 
Lipschitz payoff c, and therefore, as long as k/h is kept fixed, C above is independent of the 
grid sizes. 

This does not contradict the differentiation between convex and concave payoffs found in this 
paper, as for k, h — >■ 0, the above penalised equation is not consistent with a penalised PDE 
with penalty parameter Large, instead, as the "effective" penalty parameter Large/ k in 

1 a f 1 r Large . , . 
-Ax — —b = — : — max c — x , 0) 
k k k 

increases with k — > 0, it is consistent with the obstacle problem itself. 

If we replaced Large by k/e in the above to get a scheme consistent with the penalised PDE, 



(6.3) would still hold, however, the constant C will generally depend on k and h. For the 
butterfly, in particular, C will go to oo for h — > 0, k/h fixed. This reflects the fact that, for 
the butterfly, the limiting continuous problem shows reduced convergence order in e. For the 
put, in contrast, the required penalty parameter is asymptotically independent of the grid 
parameters. Therefore, the analysis of the limiting continuous problem informs the choice of 
penalty parameter for the discretised system. 

Smoothing and discretisation of penalised equations. The penalised PDE does not have a 
(known) closed-form solution and has to be solved numerically. Error estimates for a finite 
element approximation to the penalised heat equation have been given, e.g., by |24| and [29J, 

(6.4) \\u € -u e \\ < (c+^) ■ {k + h 2 ), 



where h is the mesh size and k the time-step of an implicit Euler or ^-method respectively, 
and || • || the L2 norm. 

In contrast, error bounds for the unpenalised problem in |18j have reduced order in k and h, 
(6.5) \\u-u\\ <c - (k 1/2 + h) . 

These results reflect the fact that penalisation smooths the solution. Consequently, the finite 



element error bounds (6.4) deteriorate for decreasing e, and the order of convergence in the 
mesh parameters is lower for the limiting variational inequality. The above results are based on 
the assumption of sufficiently smooth obstacles, such that the penalisation error is determined 
by smooth pasting at the free boundary and not any kinks of the payoff. 



PENALTY APPROXIMATION: NON-SMOOTH PAYOFFS 



29 



Value Delta 



Penalty 


1.0000 


0.4815 


Extrapolation 


2.0061 


1.5015 



Table 4. Order of convergence with respect to the penalty parameter, in the maximum 
norm, for original and extrapolated value and its derivative. The setting is the Black- 
Scholes model with parameters as earlier. 



This technique of smoothing the solutions to non-linear PDEs by penalisation in order to 
derive grid convergence rates for the limiting problem is used in the more general context of 
HJB and Isaacs equations in |15j . 

We should remark that, in practice, one can observe 0(k 3 ^ 2 + h?) convergence also for the 
limiting problem, and even Oik 2 + h 2 ) for a suitably adaptive time stepping scheme, see [10] . 
which accounts for the singular behaviour of the solution close to expiry. 



6.3. Richardson Extrapolation. We now show how extrapolation using the asymptotic 
results can be used to generate more accurate numerical solutions. 

Consider here the American put. We know from Section [3.1| (see also Table [3]) that the leading 
order correction to the penalty solution is proportional to e. So doing the calculation with e 
and 2e, then taking one twice minus the other, 

V e = 2V e - V 2e , 

will be a second order approximation to V (assuming the next term in the expansion is 
quadratic) . 

For the finite difference computation of V £ , we choose a mesh size h ~ e 1 / 2 , for two reasons. 
The (empirically observed) finite difference error is 0{h 2 ), whereas the penalisation error is 
O(e), so the above choice makes both terms the same order of magnitude. Also, although 
the convergence of the penalised PD E so lution is O(e), overlaid is a displacement of the 



exercise boundary by e 1 / 2 (see Section 3.1), at which the penalisation error changes rapidly, 
so extrapolation of the continuous equation (or, in practice, one with very small fixed grid size) 
does not result in an order improvement of the maximum error. However, extrapolation with 
mesh width 0(e 1//2 ) coupled to the penalisation, gives the desired numerical results. This is 
because the 'inner region' is not resolved within grid cells of width 0(e 1//2 ) and therefore does 
not destroy the convergence order of Richardson extrapolation. The results are summarised 
in Table [4j Note that by this procedure we gain a full convergence order in the derivative as 
well. 



The inner (asymptotic) analysis in Section 3.1 is independent of the volatility to the order of 



accuracy we have given. One could use a local volatility model and simply freeze the volatility 
at its local value. So even in non-Black-Scholes models, Richardson extrapolation may be a 
good way of using this to get a more accurate outer put value with little extra effort. The 
strategy should also work for multi-factor models. 
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6.4. Extensions. While the analysis in this article focuses on Black-Scholes and jump- 
diffusion models, the main results, especially of Section [4] and the applicability of extrap- 
olation, should extend to other settings, including local volatility models and derivatives on 
more than one underlying or on an asset modelled by additional stochastic factors, e.g. sto- 
chastic volatility or interest rates. Another interesting extension would be to free-boundary 
problems arising from portfolio selection under transaction costs, but we anticipate especially 
the matched asymptotic expansions to differ more substantially here due to the presence of 
first order derivatives in the penalty term (c.f. |21j). 
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